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Abstract 

During normal postnatal mammary gland development and adult remodeling related to the menstrual 
cycle, pregnancy, and lactation, ovarian hormones and peptide growth factors contribute to the delineation 
of a definite epithelial cell identity. This identity is maintained during cell replication in a heritable but DNA- 
independent manner. The preservation of cell identity is fundamental, especially when cells must undergo 
changes in response to intrinsic and extrinsic signals. The maintenance proteins, which are required for cell 
identity preservation, act epigenetically by regulating gene expression through DNA methylation, histone 
modification, and chromatin remodeling. Among the maintenance proteins, the Trithorax (TrxG) and 
Polycomb (PcG) group proteins are the best characterized. In this review, we summarize the structures 
and activities of the TrxG and PcG complexes and describe their pivotal roles in nuclear estrogen receptor 
activity. In addition, we provide evidence that perturbations in these epigenetic regulators are involved in 
disrupting epithelial cell identity, mammary gland remodeling, and breast cancer initiation. 
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It is currently hypothesized that breast carcinogenesis proceeds 
through sequential stages, beginning with the dysregulation of normal 
epithelial cell proliferation; progressing through atypical hyperplasia, 
carcinoma in situ, and invasive carcinoma; and finally progressing 
to metastatic disease that may have lethal consequences for the 
patient' 11 . Although accumulating evidence indicates that committed 
mammary progenitor cells are the bona fide site of breast cancer 
initiation 12 3] , compelling experimental studies have suggested that 
terminally differentiated cells could also acquire malignant features 
(mobility, invasiveness, and resistance to apoptosis) via epithelial-to- 
mesenchymal transition (EMT)' 4,51 . 

Normal human mammary glands consist of 6-12 independent 
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ductal systems that form the ductal tree. Each ductal system 
is composed of a central duct, some peripheral branches, and 
associated glandular tissue. The entire ductal tree is surrounded by 
the stroma, a heterogeneous population of cells (fibroblasts, adipose 
cells, endothelial precursors, and immune system cells) embedded in 
an extracellular matrix that provides a substrate for cell adhesion and 
acts as a reservoir of growth factors 161 . 

Studies in mice have revealed that each peripheral branch is 
capped by a terminal ductal lobulo-alveolar unit (TDLU) composed 
of a bilayered and polarized epithelium encompassing a luminal 
space. The inner luminal epithelium is formed by holocrine cells 
that possess a specialized apico-basolateral polarity, whereas the 
outer basal epithelium or myoepithelium is composed of contractile 
cells that form a basket-like network around the alveoli and ducts' 71 . 
Myoepithelial cells synthesize the basement membrane and act as an 
active structural barrier between the luminal epithelial cells and the 
surrounding intra-lobular stroma' 81 . 

Luminal and myoepithelial cells can be distinguished using 
cell type-specific markers. Luminal epithelial cell-specific markers 
include E-cadherin; epithelial cell adhesion molecule (EpCAM); 
mucin; cytokeratins 7, 8, 18, and 19; and CD24. Myoepithelial cell- 
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specific markers include smooth muscle actin (aSMA); membrane 
metallopeptidase (also known as CD10/CALLA); vimentin; 
cytokeratins 5/6, 14, and 17; and epidermal growth factor receptor 
(EGFR). 

Estrogen and the Mammary Gland 

The development of the mammary gland, unlike that of other 
human organs, is completed only postnatally when, in response to 
ovarian hormones (estrogen and progesterone), extensive TDLU 
proliferation occurs, resulting in branch extension into the surrounding 
stroma and lobule differentiation. The complete functional 
development of the lobules occurs only with full-term pregnancy 
and subsequent lactation, at the end of which the lobules involute 
but retain a larger number of individual alveoli per lobule than were 
present before pregnancy' 9,101 . 

Estrogen receptor and coregulators 

Estrogen and progesterone exert their biological functions 
through specific ligand-activated nuclear receptors. Estrogen 
bind to two distinct estrogen receptors (ERa and ER(3), which are 
encoded by genes located on two different chromosomes, whereas 
progesterone binds to two isoforms of a receptor (PR-A and PR-B) 
that are transcribed from the same gene by different promoters' 11,121 . In 
its unliganded state, ER is combined with chaperone proteins from 
which it dissociates upon hormone binding' 131 . Thereafter, the newly 
formed ligand-receptor complex dimerizes and translocates into the 
nucleus, where it can bind directly to specific sequences, termed 
estrogen response elements (EREs), located within the promoter/ 
enhancer regions of their target genes. This complex can also 
affect gene transcription indirectly through its physical interaction 
with other transcription factors, such as E2F1 , activating protein 1 
(AP1), Sp1, Stat3, and nuclear factor-KB (NF-kB)' 14,151 (Figure 1). In 
addition to ligand availability, ER transcriptional activity depends on 
the cooperation of specific proteins, collectively called "coregulators," 
that act as coactivators or compressors' 161 . ER coactivators, which 
include steroid receptor coactivator (SRC)/p160 family members, E3 
ubiquitin-protein ligases, and the histone acetyltransferase cAMP 
responsive element binding protein (CREB)-binding protein (CBP)/ 
p300, are recruited directly by ER and enhance ER-mediated gene 
expression; in breast cancer, the best studied coactivator is amplified 
in breast cancer (AIB)-1/SRC-3 [1?1 . Conversely, corepressors inhibit 
gene expression through a direct interaction with unliganded ER 
or competition with coactivators for ER binding. Most corepressors 
are widely distributed in human tissues, and the corepressors 
that are best characterized in breast cancer are nuclear receptor 
corepressor (N-CoR) and silencing mediator of retinoid and thyroid 
hormone receptors (SMRT)' 18,191 . These coregulators are not merely 
"bridging" agents linking ER and the transcription machinery; rather, 
they perform a wide range of enzymatic activities (e.g., acetylation, 
methylation, and ubiquitination) that are essential for the appropriate 
regulation of the transcription process as part of multicomponent 
protein complexes. Because of their roles in hormone-dependent 



gene transcription, it is not surprising that coregulator misexpression 
is a common feature of endocrine-related tumors including breast, 
uterine, ovarian, and prostate cancers. 

Estrogen receptor and pioneer factors 

Estrogen receptor and FoxA 1 

Among the proteins that facilitate the ER-DNA interaction, 
"pioneer factors" represent a special class of proteins that can 
associate directly with condensed chromatin independent of 
and prior to any other transcription factors, allowing chromatin 
accessibility' 201 . Several protein families, including Forkhead box 
A (FoxA)' 211 , transducin-like enhancer (TLE) [22] , and, to a moderate 
degree, Gata' 231 , have been proved to possess this "pioneer" property. 
Originally described as crucial transcriptional components during 
development and differentiation, pioneer factors have recently been 
implicated as mediators of nuclear receptor function under both 
normal and pathologic conditions' 241 . In particular, FoxA1 has been 
proved to be a master regulator of ER activity due to its ability to bind 
to DNA and core histones simultaneously, to rearrange chromatin 
structure by disrupting local internucleosomal interactions and to 
recruit ER to target gene promoters' 25,261 (Figure 2). The unique 
"pioneer" capability of FoxA1 is due to the presence of a so-called 
forkhead box domain, a variant of the helix-turn-helix structure that 
has two large loops, giving it the appearance of a "winged helix"' 271 . 
Experimental evidence has indicated that FoxA1 moves slowly along 
the chromatin, scanning it for enhancers with forkhead motifs' 281 . 
Upon finding a forkhead motif, the central helix-turn-helix of FoxA1 
makes direct contact with the major groove of the DNA, while each 
wing interacts with minor grooves adjacent to the target sequence, 
thus stabilizing the interaction with the DNA' 291 and triggering the 
transcriptional competency of the enhancer. 

Despite its recognized role as a pioneer factor, the binding 
capability of FoxA1 can be enabled or restricted by certain chromatin 
modifications, such as methylation and acetylation. Due to the 
lack of a classic methylated GC sequence in the forkhead box 
domain, FoxA1 -binding activity occurs primarily at hypomethylated 
DNA regions. Consequently, heavily methylated DNA regions may 
have relatively poor FoxA1 -binding potential 1301 . In addition to DNA 
hypomethylation, FoxA1 -binding activity depends on the methylation 
status of histone H3, specifically on the mono- and di-methylation 
of histone H3 at lysine 4 (H3K4me1 and H3K4me2). Experimental 
evidence has indicated that the demethylation of these histone 
sites inhibits FoxA1 binding 1311 , suggesting that H3K4 methylation 
is necessary to stabilize FoxA1 binding and allow the subsequent 
recruitment of transcriptional regulators, including sex steroid 
hormone nuclear receptors. The functional relationship between 
FoxA1 pioneering activity and ER-binding activity is corroborated 
by the observation that, in the developing mammary gland, FoxA1 
and ER are co-expressed within luminal epithelial cells, especially 
at the terminal end buds, and that a FOXA1 gene deletion results in 
the loss of ER expression' 32 '. Because luminal progenitors give rise 
to the ductal lineage, FOXA1 deletion and the consequent loss of 
ER expression may cause the arrest of duct elongation and lobule 
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Figure 1 . The estradiol (E)-estrogen receptor (ER) 
complex may activate gene transcription directly by 
binding to specific sequences (estrogen response 
elements, EREs) within the promoter/enhancer regions 
of target genes or indirectly by physical interaction with 
other transcription factors, such as activating protein 
1 (AP1). CBP, histone acetyttransferase CREB-binding 
protein; SRC-1, steroid receptor coactivator-1 ; AIB1, 
amplified in breast cancer 1 . 




Figure 2 . Forkhead box A1 (FoxA1) is a 
pioneer factor belonging to a special class 
of proteins that are able to interact directly 
with condensed chromatin prior to any 
other transcription factors. Upon chromatin 
binding, FoxA1 induces a nucleosomal 
rearrangement that results in a more relaxed 
chromatin structure, which is accessible to 
other transcription regulators, including ERs. 
Green diamonds represent epigenetic histone 
modifications (dimethylation of histone H3 at 
lysine 4, H3K4me2). 



differentiation in response to hormones during puberty or pregnancy. 

Estrogen receptor and Gata-3 

As a member of the Gata family, Gata-3 has long been 



recognized as essential for mammary gland development and 
maintenance of the terminal differentiation of the luminal lineage 
in adults 133,341 . More recently, Gata-3 has been proven to function 
as a pioneer factor. Co-recruited with FoxA1 to ER cis-regulatory 
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elements, Gata-3 is essential for the ER-mediated transcription 
of target genes 1351 . More specifically, experimental evidence has 
indicated that ER and Gata-3 participate in a positive feedback loop 
in which ER expression is required for GATA3 gene transcription' 35 '. 
This finding supports that the loss of Gata-3 in the normal mammary 
gland decreases the ER-expressing luminal population' 33,34 '. 

Dissociation between ER expression and cell proliferation 
In addition to ovarian hormones, several other factors 
orchestrate the development of the human mammary gland via 
stroma-epithelium interactions' 36 " 38 '. In fact, ER and PR are expressed 
only in approximately 30% of luminal cells, and basal cells do 
not express hormone receptors at all. Furthermore, in the normal 
mammary gland, ER-positive cells are quiescent and act as "steroid 
hormone sensors," producing paracrine/juxtacrine factors to regulate 
the proliferative activity of adjacent ER-negative epithelial cells. This 
phenomenon is known as dissociation between steroid receptor 
expression and cell proliferation and aims to prevent the overgrowth 
of ER-positive mammary epithelial cells in an antitumor strategy' 39 " 42 '. 
The observation that 70% of primary breast cancers are ER-positive, 
in contrast to normal mammary glands, strongly suggests that this 
control mechanism is disrupted during breast tumorigenesis and that 
the enhanced activity of the ER signaling pathway then becomes the 
major autocrine force driving tumor growth' 431 . 

Maintenance Proteins 

During mammary gland development, ovarian hormones and 
peptide growth factors act in coordination to define a profile that 
depends on the differential expression of genes that are specifically 
involved in epithelial cell differentiation. This tissue-specific profile 
is maintained during cell replication so that the daughter cells will 
retain the cell type identity of the parental cell (cellular memory), even 
when cell activity must change in response to intrinsic and extrinsic 
signals. The preservation of cell identity (i.e., the maintenance of 
a specific gene expression profile) depends on a class of proteins 
that are collectively termed maintenance proteins' 441 . Previous 
studies have shown that maintenance proteins act epigenetically 
by regulating gene expression in a heritable fashion, independent 
of DNA sequence. This epigenetic control is achieved through DNA 
methylation, histone modification, and chromatin remodeling' 45,46 '. 
Trithorax (TrxG) and Polycomb (PcG) group proteins are the best 
characterized maintenance proteins' 47,48 '. 

Discovered in Drosophila, the TrxG and PcG proteins 
play a crucial role in the epigenetic control of a large number of 
developmental genes. Genetic studies have established that in 
Drosophila, the TrxG and PcG proteins exert an antagonistic action 
on the transcription of homeotic (HOX) genes, a cluster of genes 
that defines the positional identity of the body segments along the 
anterior-posterior axis of the fly during early embryogenesis' 49 '. PcG 
proteins act as transcriptional repressors that maintain an "off state," 
whereas TrxG proteins are transcriptional activators that maintain 
an "on state." Mutations in specific PcG and TrxG genes generate 
flies with homeotic transformations due to the misregulation of 



developmental master Hox genes' '. 

Additional studies performed in other organisms have revealed 
that TrxG and PcG proteins can act as transcriptional activators or 
repressors depending on the multi-protein complex in which they 
are included, and these proteins have been implicated in a variety 
of normal cellular processes, including genomic imprinting' 51 ', X 
chromosome inactivation' 52 ', stem cell identity' 53 ', and cell prolifera- 
tion' 54 '. 

TrxG complexes 

Based on the function of their key component, the TrxG multi- 
protein complexes are classified as ATP-dependent chromatin- 
remodeling complexes that regulate gene transcription in an energy- 
dependent manner or covalent histone-modifying complexes [55,56 '. 

ATP-dependent chromatin- remodeling complexes 

In all ATP-dependent chromatin-remodeling complexes, the 
central catalytic subunit belongs to the sucrose non-fermenting 2 
(SNF2) family of ATPases; through this subunit, the complex uses the 
energy of ATP hydrolysis to fuel nucleosome translocation along the 
DNA, histone H2A/H2B ejection from the nucleosome, entire histone 
octamer relocation, and histone H2A variant exchange' 57,58 '. According 
to the domain architecture of the catalytic subunit, the complexes 
are classified into seven families, four of which are preserved in all 
mammalian cells. In humans, the most prominent of these complexes 
are the switching-defective (SWIJ/SNF and imitation switch (ISWI) 
complexes' 59 '. 

SWI/SNF complex: structure and function 

A SWI/SNF complex is composed of a central ATPase subunit, 
which can be either brahma (BRM) or BRM-related gene 1 (BRG1), 
and 10-12 proteins, known as BRG1 -associated factors (BAFs). 
Despite their high overall homology (approximately 75% identity at 
the amino acid level), the BRG1 and BRM sequences diverge at the 
N-terminal region; these sequences regulate their binding to different 
transcription factors, and they form mutually exclusive SWI/SNF 
complexes. Consequently, the SWI/SNF complex may play different 
and sometimes antagonistic roles depending on whether BRG1 or 
BRM is the central catalytic subunit' 60 '. 

SWI/SNF complexes may exist in two forms, SWI/SNF-BAF 
or SWI/SNF-polybromo-associated BAF (PBAF), depending on 
their BAF composition. Both forms contain a highly conserved 
"core" of BRG1 (sometimes BRM), Baf47/hSNF5/INI1, Baf1 55, and 
Baf170, and the additional subunits Baf53, Baf57, Baf60, and P-actin 
(Figure 3). The two forms are distinguished by the presence of 
specific subunits: Baf250 in SWI/SNF-BAF or Baf180, Baf200, and 
bromodomain-containing 7 (Brd7) in SWI/SNF-PBAF' 61 '. 

Several models have been postulated to explain how SWI/ 
SNF modifies chromatin structure, especially the ATP-dependent 
mobilization of nucleosomes in cis along the DNA and the ejection 
and insertion of histone octamers' 62 '. For nucleosome sliding, it has 
been proposed that after binding to nucleosomal DNA, the SWI/SNF 
complex disrupts histone-DNA association and makes DNA molecule 
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Figure 3. Brahma-related gene 1 (BRGI)-containing chromatin-remodeling complexes. In addition to the switching-defective (SWI)/sucrose non- 
fermenting (SNF)-BRGI-associated factors (BAF) and SWI/SNF-polybromo-associated BAF (PBAF) complexes, the catalytic subunrt BRG1 (or, less 
frequently, BRM) can associate with several chromatin-remodeling complexes, including transcription coactivators and co repressors. BRG1 can be 
found in complexes with transcription coactivators and histone-modifying enzymes such as Williams syndrome transcription factor (WSTF)— including 
nucleosome assembly complex (WINAC) and nucleosomal methylation activation complex (NUMAC). Conversely, BRG1 can be assembled in complexes 
known to repress transcription and induce gene silencing, such as the nuclear receptor co repressor (N-CoR) and mSin3A/histone deacetylase (HDAC) 
complexes. Orange: BRG1 subunrt; gold: essential subunits; green: nuclear receptor-associated factors; red: DMA replication factors; yellow: actin- 
related complexes; blue: factors with other functions. 



more accessible to DNA-binding factors. However, the mechanism 
by which nucleosome ejection and insertion occur has not been 
completely elucidated. Recent findings suggest that histone ejection 
may occur not at the nucleosome that is directly bound by the SWI/ 
SNF complex but rather at the adjacent nucleosome following 
the repositioning of the bound nucleosome' 631 . Regardless of the 
mechanism, the change in chromatin structure induced by the SWI/ 
SNF complex allows a large number of transcription factors, including 
nuclear steroid receptors, to bind to DNA efficiently' 64 " 661 . 

Although the SWI/SNF complex in S. cerevisiae was identified 
based on its role in the activation of transcription, experimental 
evidence indicates that mammalian SWI/SNF complexes contribute 
to both gene repression and activation. In embryonic stem 
cells, for instance, BRG1 may act as a transcription repressor 
of genes associated with differentiation and an enhancer of the 
core pluripotency transcriptional network' 67 '. Similarly, the forced 



overexpression of BRG1 enhances the reprogramming of adult 
fibroblasts into induced pluripotent stem cells by facilitating Oct4 
binding to target genes' 681 . 

It was initially assumed that the whole SWI/SNF complex 
was required for chromatin remodeling, but biochemical studies 
have indicated that both BRG1 and BRM may have remodeling 
activity independent of other SWI/SNF subunits. The addition of 
core subunits (Baf47, Baf1 55, and Baf1 70) is required to achieve 
chromatin-remodeling activity at near-optimal levels' 69 ', whereas non- 
essential subunits (Baf53, Baf57 BafoO, and fi-actin) aid transcription 
regulation. In particular, Baf57 is required for hormone steroid 
receptor activity' 70 ', whereas BAF- and PBAF-specific subunits 
(Figure 3) are essential for further specifying SWI/SNF complex 
activity. Baf250a and Baf250b, for instance, are mutually exclusive, 
and the corresponding Baf250a- and Baf250b-associated complexes 
have antagonistic effects on cell cycle progression, with Baf250a 
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participating in the repression and Baf250b in the induction of key 
cell cycle regulators" 11 . In addition, Baf250b contributes to the 
proper expression of genes involved in embryonic stem cell self- 
renewal' 721 . Conversely, the PBAF-specific subunit Bafl 80 is a critical 
modulator of p21 Cip1 induction and acts as a negative regulator of cell 
proliferation' 731 . 

SWI/SNF complex mutations and malignant 

transformation 

Because of the essential role of the SWI/SNF complex in 
various and unrelated pathways, it is not surprising that mutations 
in one or more subunits of the complex can be associated with 
malignant transformation' 741 . Inactivating mutations in the Baf47 
subunit, for instance, are present in the majority of malignant 
rhabdoid tumors, a class of extremely aggressive and lethal cancers 
of early childhood that arise primarily in the kidney, brain, and other 
soft tissues' 751 . Experimental studies in stable human cell lines 
generated from malignant rhabdoid tumors have indicated that Baf47 
acts as tumor suppressor and that its forced re-expression leads 
to cell cycle arrest associated with increases in p16 tNK4a , E2F1, and 
cyclin D expression' 76 '. Other SWI/SNF subunits are believed to 
function as tumor suppressors. Baf155 and Baf250a have recently 
been associated with a loss of tumor suppressor activity in human 
cancer cell lines' 77 ' and gynecologic cancers' 78 ', whereas ARID1A, 
the gene encoding the Baf250a subunit, has been found to be 
altered in approximately 10% of breast carcinomas' 79 '. A similar 
negative activity on cell proliferation has been described for the 
Bafl 80 subunit, mutations in which have been identified in a subset 
of breast cancers |73 '. However, the most common cancer-related 
changes in the SWI/SNF complex are the loss or inactivation of the 
BRG1 or BRM subunit, which occur in a variety of tumors, including 
breast cancer' 80,81 '. The genetic or epigenetic silencing of the BRG1 
subunit is of particular relevance considering that, in addition to 
the BAF/PBAF complexes, BRG1 is the pivotal component in 
several other chromatin-remodeling and histone-modifying enzyme 
complexes, including the Williams syndrome transcription factor 
(WSTF)-including nucleosome assembly complex (WINAC), 
N-CoR, nucleosomal methylation activation complex (NUMAC), 
and mSin3A/histone deacetylase (HDAC) complex' 82 " 84 ' (Figure 3). 
In particular, when associated with coactivator-associated arginine 
methyltransferase-1 (CARM1) to form the NUMAC complex, BRG1 
can recruit nuclear hormone receptors to EREs and cooperatively 
activate estrogen-dependent gene transcription' 85 " 87 '. Notably, in the 
presence of an estrogen antagonist, BRG1 -containing complexes 
undergo a reversal of functional activity and act as corepressors, 
recruiting HDAC1, CBP/p300, and prohibitin and significantly 
inhibiting promoter activation' 88 '. 

ISWI complex: structure and function 

The second most prominent ATP-dependent chromatin- 
remodeling complex is ISWI, the human ortholog of the Drosophila 
nucleosome-remodeling factor (NURF) complex' 83 '. ISWI is composed 
of four subunits: a catalytic subunit (SNF2 ligand, SNF2L), a histone- 
binding component [bromodomain plant homeo domain (PHD)- 



finger transcription factor (BPTF)], and two highly homologous 
retinoblastoma-associated proteins, RbAp46 and RbAp48. SNF2L is 
the energy-transducing component, and BPTF specifically recognizes 
histone H3 tails that are trimethylated at lysine 4 (H3K4me3), which 
are present at the transcription start sites of virtually all active 
genes' 90 '. RbAp46 and RbAp48 are involved in histone acetylation' 91 '. 
In mammary tissue in particular, RbAp46 and RbAp48 have opposing 
effects on the expression of estrogen-repressed genes; RbAp46 
increases the expression of repressed genes, whereas RbAp48 
decreases their expression' 92 '. Because the ability of RbAp48 to inhibit 
gene expression is most evident in the absence of an ER ligand, it 
has been proposed that the primary role of this protein might be in 
the control of estrogen-repressed gene expression in a hormone- 
free environment. Conversely, the RbAp46 protein should control 
estrogen-responsive gene expression in the presence of ovarian 
hormones' 93 '. 

Although the ISWI complex has been studied extensively in 
normal brain development and neurodegenerative diseases' 94 ', little is 
known about its functional importance in cancer. However, emerging 
evidence indicates that the inhibition of SNF2L expression may have 
dramatic effects on the viability of different cancer cell lines' 95 ' and 
that BPTF deregulation due to a translocation breakpoint may confer 
a cancer-promoting phenotype in these cells' 96 '. In addition, RbAp46 
and RbAp48 dysregulation has been linked to carcinogenesis in 
various tissues, including breast tissue' 971 . 

Covalent histone-modifying complexes 

Covalent histone-modifying complexes catalyze the addition 
and removal of post-translational modifications (e.g., methylation, 
acetylation, phosphorylation, and ubiquitination) on histone pro- 
teins' 5698 '. By enhancing or reducing chromatin condensation, these 
modifications affect the binding affinity between histones and DNA, 
thus enhancing or preventing gene transcription. 

Histone methyltransferases: structure and function 

Among the proteins within histone-modifying complexes, histone 
methyltransferases (HMTs) are histone-modifying enzymes that 
specifically catalyze the transfer of one, two, or three methyl groups 
to lysine and arginine residues of the histone tail, predominantly 
those of histones H3 and H4' 99 '. For this reason, they are sometimes 
described as "epigenetic writers." H3K4 methylation is an 
evolutionarily conserved mark, and the degree of this methylation is 
biologically significant because proteins that interact with methylated 
histones are able to distinguish between mono-, di-, and trimethylated 
lysines. Based on their target residues, HMTs can be subdivided 
into protein arginine methyltransferases (PRMTs) and protein lysine 
methyltransferases (PKMTs). PKMTs are further subdivided into 
su(var)3-9, enhancer of zeste, trithorax (SET) domain-containing, and 
non-SET domain-containing proteins. All members of the TrxG family 
with HMT activity contain a SET domain, and in concert with the 
ISWI class of ATP-dependent chromatin-remodeling enzymes, these 
proteins recognize disrupted nucleosomal structure and methylate 
H3K4. 

Represented by a single member in yeast, PKMTs diverged and 
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developed increased structural and functional complexity in higher 
eukaryotes 11001 . Humans express at least eight PKMTs, including 
several members of the TrxG family. MLL1 , the founding member 
of the PKMT family, was discovered in 1991 upon the cloning of the 
gene involved in human leukemias carrying chromosome band 11q23 
translocations, hence the name mixed lineage leukemia (MLL)' 1011 . 
Subsequent biochemical and genetic studies provided evidence for 
the activity of MLL1 in a wide range of physiologic and pathologic 
cellular processes, including cell proliferation and differentiation, 
through the regulation of Hox and non-Hox gene expression 1102 " 1041 . 
Since the discovery of MLL1 , four additional members, MLL2-MLL5, 
have been identified. Although their detailed mechanisms of action 
are still largely unknown, some of these proteins interact with nuclear 
receptors, including ER, to coordinate hormone-dependent gene 
regulation 1105 ' 1061 . 

Because of their widespread roles in gene regulation, it is 
not surprising that changes in the expression of MLL genes due 
to deletion or truncation may have detrimental effects on crucial 
physiologic processes and may be associated with human diseases, 
including cancers. The MLL3 gene, for example, is frequently 
mutated in many types of human solid tumors, including breast 
cancer, approximately 40% of which show reduced MLL3 protein 
expression 11071 . A similar oncosuppressive activity has been suggested 
for MLL5, which has been implicated in myeloid leukemia 11081 . 
Biochemical and genetic studies have established that MLL5 acts as 
an important cell cycle regulator. MLL5 overexpression inhibits cell 
cycle progression by suppressing the inappropriate expression of S 
phase-promoting genes and inducing cell cycle arrest at both the 
and G 2 /M phases 11091 . 

In addition to the MLL catalytic subunit, all PKMT complexes 
contain a common accessory subunit core formed by five subunits: 
Drosophila Ash2-like (Ash2L), retinoblastoma-binding protein 5 
(Rbbp5), WD repeat domain 5 (Wdr5), mammalian Dpy-30 homolog 
(mDpy-30), and CpG-binding protein (CGBP) 1111M141 (Figure 4). The 
currently available evidence indicates that these core subunits 
interact cooperatively to form a structural platform, which is essential 
for MLL-mediated histone methylation' 1151 . Ash2L regulates catalytic 
activity, whereas the multifunctional Wdr5 subunit serves as a binding 
adaptor for the MLL-SET domain to dimethylated histone H3K4 
(H3K4me2), which is subsequently converted to the trimethylated 
form (H3K4me3). Genetic studies in embryonic stem cells have 
shown that the depletion of a single core subunit, such as mDpy-30, 
may alter the H3K4-specific methyltransferase activity of the complex 
and consequently misregulate the "bivalent domain" status of genes 
involved in stem cell fate 11161 . "Bivalent domains" are pivotal chromatin 
structures composed of both repressive (H3K27 methylation) and 
activating (H3K4 methylation) marks that silence developmental 
genes while preserving stem cells' ability to be activated in response 
to appropriate developmental cues. Therefore, changes in H3K4- 
specific methyltransferase activity may strongly affect embryonic 
stem cell fate specification 11171 . 

In higher eukaryotes, the subunit core can associate with a 
broad spectrum of accessory proteins that coordinate the crosstalk 
between PKMTs and other enzymes and coregulators involved 



in histone modification (e.g., acetylation, deacetylation, and 
demethylation), as well as some components of SWI/SNF chromatin- 
remodeling complexes' 1181 . The intricate interrelation among HMT 
complexes, ATP-dependent chromatin-remodeling complexes, 
and the nuclear ER is particularly relevant to mammary gland 
development and differentiation. Studies have demonstrated that 
upon estrogen stimulation, the MLL2-associated HMT complex is 
recruited to the promoters of ER target genes along with ligand- 
bound ER, thereby activating transcription 11051 . In addition to its direct 
interaction with MLL2, ER may interact with the HMT complex via 
other proteins, such as Menin and activating signal cointegrator-2 
(ASC-2). It has been demonstrated that Menin, the product of the 
multiple endocrine neoplasia type 1 (MEN1) tumor suppressor gene, 
interacts with ER directly through its activation function 2 (AF2) 
domain and acts as a coregulator, recruiting the MLL2 complex to the 
promoters of estrogen-responsive genes 1119,1201 (Figure 4). Similarly, 
ASC-2 participates actively in the transactivation of several nuclear 
receptors including ER as part of a multi-protein complex, termed 
ASCOM (for ASC-2 complex), that includes MLL3 or MLL4 as its 
catalytic subunit. 

In addition to ASC-2, Baf47/hSNF5/INI1 , a core subunit of SWI/ 
SNF chromatin-remodeling complexes, has been shown to interact 
directly with MLL3 and MLL4, suggesting a functional crosstalk 
between chromatin-remodeling (specifically, SWI/SNF) and covalent 
histone-modifying complexes (specifically, ASCOM) that promotes 
the binding of both complexes to ER target genes. Because of their 
roles as cofactors in estrogen-dependent gene transcription, it is not 
surprising that a change in the expression of Menin or ASC-2 protein 
may contribute to neoplastic transformation. Menin is overexpressed 
in the breast cancer cell line MCF-7, colocalizes with ER, and 
functions as a direct coactivator of ER-mediated transcription 11201 . 
Similarly, ASC-2 has been found to be amplified and overexpressed 
in breast, colon, and lung cancers. Further experimental evidence 
suggests that ASC-2 may interact with several mitogenic transcription 
factors, including AP-1 and NF-kB, and promote cell proliferation 
through the enhancement of E2F1 -dependent transactivation 1121,1221 . 

Histone demethylases: structure and function 

In addition to PKMTs, which act as epigenetic writers, TrxG 
proteins include some elements deputed to histone demethylation 
(epigenetic erasers). For many years, histone methylation was 
believed to be an irreversible modification. On the contrary, recent 
studies have demonstrated that histone methylation can be 
dynamically regulated through active demethylation. In particular, 
genetic studies in Drosophila have demonstrated that little imaginal 
discs (Lid), a Jumonji C domain-containing TrxG protein, can 
demethylate the trimethylated form of H3K4, thus removing the 
activation marks and facilitating gene silencing 11231 . The four human 
orthologs of the Lid protein (RBP2/JARID1A, PLU-1/JARID1B, SMCX, 
and SMCY) belong to the JARID1/KDM5 protein family and show 
high sequence conservation. Despite the sequence similarity and the 
putative functional redundancy among these proteins, experimental 
evidence indicates that individual members have unique functional 
properties and divergent expression profiles. Retinoblastoma-binding 
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Figure 4. Mixed lineage leukemia proteins (MLLs) are the catalytic subunits of covalent histone-modifying complexes and have H3K4- 
specific methyltransferase activity. According to a proposed working model, Menin interacts directly with ER through its activation function 
2 (AF2) domain, recruiting the MLL complex to the ERE of the target gene. In multi-protein complexes that include MLL3 or MLL4 as the 
catalytic subunrt, the activating signal cointegrator-2 (ASC-2) protein acts as a coactivator in the transactivation of the nuclear estrogen 
receptor. This crosstalk functions at least in part by mutually facilitating the recruitment of the MLL and ER complexes to ER target genes. 



protein 2 (RBP2), the first identified and best characterized member 
of the JARID1 family, controls two groups of genes, one involved in 
cellular differentiation and one implicated in mitochondrial function 
and RNA/DNA metabolism 11241 . More specifically, it is positively 
involved in the transcriptional regulation of Hox proteins during cell 
differentiation and in steroid receptor-mediated transcription 1125,1261 . 
Very recent experimental findings have implicated RBP2 in the 
positive regulation of PR expression in the ER-positive breast cancer 
cell line MCF-7, showing that RBP2 removes the H3K4 methylation 
mark from the ERE downstream of the PR transcription start site 11271 . 
PLU-1 , the second member of the family, was originally identified in 
human breast cancer cell lines' 128 ', is required for cell proliferation in 
the mammary gland' 129 ', and causes the down-regulation of tumor 
suppressor genes, such as BRCA1, in ER-positive breast cancer 
cells' 130 '. Less investigated than RBP-2 and PLU-1 are Smcy homolog 
X-linked (SMCX) and Smcy homolog Y-linked (SMCY), encoded by 
two genes located on chromosome X and Y, respectively. SMCX 
acts at the terminal stage of the neuronal differentiation pathway, as 
demonstrated by its association with X-linked mental retardation, and 
SMCY is involved in meiosis during spermatogenesis' 131,132 '. 

PcG complexes 

PcG proteins were first discovered in Drosophila, in which 
they work in a reciprocal manner with TrxG proteins during embryo 
development' 49,50 ' and establish a heritable cellular memory system, 
the disruption of which affects normal embryonic development and 



eventually triggers neoplastic transformation' 1 '. 

Similar to TrxG proteins, PcG proteins form multimeric 
complexes with several other enzymes. To date, two distinct 
complexes, polycomb repressive complex (PRC)-1 and PRC-2, have 
been identified in humans (Figure 5). 

PcG complexes: structure and function 

B lymphoma Mo-MLV insertion region 1 homolog (Bmi-1) is 
the central component of PRC-1, which comprises one or more 
proteins harboring a chromobox domain (Cbx2, Cbx4, and Cbx8), 
one or more homologs of Drosophila polyhomeotic protein (PHC1, 
PHC2, and PHC3), and the RING1/RING1b proteins. In 1993, 
melanoma nuclear protein 18 (Mel-18), an additional member of the 
PRC-1 complex that is structurally related to Bmi-1 , was isolated and 
characterized' 135 '. Bmi-1 was originally identified as an oncogene that, 
when overexpressed, collaborates with c-Myc to induce the formation 
of B-cell lymphomas' 136 '. Bmi-1 is characterized by a RING domain, 
which is essential for its formation of a heterodimer with RING1/ 
RING1 b subunits and induction of the monoubiquitilation of lysine 
119 of histone H2A (H2AK119ub)' 137 '. The Cbx2, Cbx4, and Cbx8 
subunits are essential for the reading of the methylation marks 
set by PRC-2 (specifically, trimethylation of histone H3 at lysine 27), 
whereas PHC1 and its paralogs PHC2 and PHC3 are required to 
maintain the transcriptionally repressed state of the target gene' 138 '. 

PRC-2 (also known as the EED-EZH2 complex) contains three 
essential subunits: a catalytic subunit with methyltransferase activity, 
enhancer of zeste homolog 2 (EZH2), or, less frequently, its paralog 
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Figure 5. Polycomb repressive complex (PRC)-1 and PRC-2 work cooperatively. Enhancer of zesfe homolog 2 (EZM2), the catalytic submit of PRC-2 
with methyttransferase activity, mediates the covalent trimethylation of histone H3 at lysine 27 (H3K27me3). This specific histone modification serves as 
the docking site for proteins harboring a chromobox domain (especially Cbx4 and Cbx8) that aid in the recruitment of PRC-1 to chromatin through their 
interactions with H3K27me3. Thereafter, PRC-1 catalyzes the monoubiquitilation of histone H2A at lysine 119 through the enzymatic action of RING1/ 
RING1 b and blocks transcriptional elongation by RNA polymerase II. 



EZH1 , and two noncatalytic subunits, suppressor of zeste 12 (SUZ12) 
and embryonic ectoderm development (EED). Similar to TrxG 
proteins with methyltransferase activity, EZH2 and EZH1 contain 
a SET domain, which is essential for histone lysine methylation. In 
quiescent cells, EZH2, SUZ12, and EED are present at low levels, 
but their expression increases at the G,/S phase transition, when 
they up-regulate the expression of certain cell cycle-related factors 
and concomitantly inhibit the expression of apoptosis-related genes. 
In addition to these essential components, PRC-2 may contain other 
non-exclusive subunits, including RpbAp48 or RpbAp46, through 
which it can bind to certain ATP-dependent chromatin-remodeling 
complexes m] . 

PRC-1 and PRC-2 function cooperatively (Figure 5). Initially, 
EZH2, the PRC-2 catalytic subunit with methyltransferase activity, 
mediates the covalent trimethylation of histone H3 at lysine 27 
(H3K27me3)' 1391 . This specific histone modification serves as the 
docking site for proteins harboring a chromobox domain (especially 
Cbx4 and Cbx8) that facilitate the recruitment of PRC-1 to chromatin 
through their interactions with H3K27me3 modifications' 1401 . 
Thereafter, PRC-1 catalyzes the monoubiquitilation of lysine 119 at 
histone H2A through the enzymatic action of RING1/RING1 b and 
blocks the transcriptional elongation of RNA polymerase ll' 1411 . 

PcG complexes and malignant transformation 

Because of their pivotal role as epigenetic silencers, some 
components of PRC-2 and PRC-1 are involved in a variety of 



tumors. The overexpression of EZH2 has been associated with 
ovarian, cervical, colon, renal, oral, gastric, prostatic, and pancreatic 
cancers 1142 " 1431 , and genetic alterations including polymorphisms 
and inactivating mutations have been described in lung cancer, 
lymphoma, and myeloid neoplasms' 150,1511 . SUZ12 and EED have 
been found to be up-regulated in several human tumors, including 
tumors of the colon and liver' 1521 , malignant pleural mesotheliomas, 
mantle cell lymphoma, pulmonary carcinomas, and melanoma' 153-1541 . 
Similarly, Bmi-1 and RING1 overexpression are associated with 
ovarian, endometrial, cervical' 155 1561 , prostate, bladder, pancreatic, 
colon, and lung cancers' 148,157 " 1601 , different types of leukemias and 
lymphomas' 161,1621 , neuroblastoma' 1631 , and medulloblastoma' 1641 . 

In breast cancer, PcG proteins, especially EZH2, Bmi-1, and 
SUZ12, are significantly overexpressed even at the very earliest 
stages of neoplastic transformation' 165,1661 . The expression of EZH2 
is low or absent in morphologically normal lobules, whereas EZH2 
expression progressively increases with increasing severity of 
epithelial atypias' 1671 . In a very recent study, we found that atypical 
ductal hyperplasia (ADH) and ductal carcinoma in situ (DCIS) 
surgical specimens dramatically overexpressed the EZH2 gene 
compared with histologically normal breast tissue (127% in ADH and 
207% in DCIS). Notably, we found that EZH2 overexpression was 
associated with a well-differentiated, ER-positive luminal phenotype, 
as indicated by the high expression levels of ER, cytokeratin (CK)-18, 
CK-19, Mucin 1, and Gata-3, all of which are markers of a terminal 
luminal phenotype. Considering the physiologic turnover of mammary 
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epithelial cells during a woman's reproductive life (i.e., cyclical 
switching between differentiation and involution), this finding seems to 
suggest the persistence of a differentiated phenotype due to a defect 
in lobular involution. Furthermore, considering that lobular involution 
has been proved to be associated with a reduced risk of breast 
cancer and that epidemiologic studies have indicated that women 
with intraductal epithelial hyperplasia have a greater risk of breast 
cancer than women without this abnormality' 1681 , the association 
between EZH2 overexpression and increased breast cancer risk is 
not surprising. Unfortunately, the mechanism through which EZH2 
promotes epithelial hyperplasia is largely unknown. Preliminary 
experimental evidence indicates that the EZH2 protein interacts 
directly with ER and (3-catenin, linking estrogen activity and the Wnt/ 
(3-catenin signaling pathway and interfering in normal mammary 
gland differentiation 11691 . Consistent with the association between 
EZH2 overexpression and tumor aggressiveness, breast cancer 
subtypes that are defined as aggressive based on their molecular 
profile (basal-like, HER2-enriched tumors) or clinical markers (triple- 
negative tumors) show significantly higher EZH2 expression than 
subtypes (luminal A, luminal B, and ER*/HER2~) that are generally 
associated with a favorable prognosis' 1701 . 

Studies have demonstrated that Bmi-1 overexpression is also 
associated with aggressive phenotypes and poor prognosis' 1711 . 
The hypothesis that Bmi-1 overexpression increases cell motility 
and invasive capability is based on the observation that, in highly 
metastatic breast cancer cells, Bmi-1 repression markedly reduced 
cellular motility, invasion, and metastatic spread' 1721 . Bmi-1 is also 
thought to be involved in early stages of breast tumorigenesis by 
affecting the self-renewal and proliferation of normal mammary stem 
cells' 1731 . Studies have demonstrated that Bmi-1 represses the INK4A/ 
ARF locus, a well-known tumor suppressor region that encodes 
P16 lnk4a and P14 Arf , two cyclin-dependent kinase (Cdk) inhibitors' 1741 . 
P16 lnk4a affects retinoblastoma (Rb) protein activity by inhibiting the 
Cyclin D/Cdk4/6 complex. In the absence of Bmi-1, P16 lnk4a is up- 
regulated and prevents the binding of Cdk4/6 to Cyclin D, thus 
inhibiting Rb phosphorylation. Consequently, hypophosphorylated 
Rb binds to E2F, thus inhibiting the transcription of E2F target genes, 
including those involved in the G,-S transition. Meanwhile, P14 Arf 
stabilizes P53 by antagonizing mouse double minute 2 (MDM2) and 
activating P53-dependent transcription, which induces cell cycle 
arrest in or G 2 /M and subsequent apoptosis' 1751 . Experimental 
evidence has indicated that the Bmi-1 -induced down-regulation of 
P16 lnk4a and P14 Arf facilitates cell immortalization. Furthermore, in 
human mammary epithelial cells, the removal of P16 lnk4a activity 
causes EZH2 and SUZ12 to be up-regulated and recruited to 
HOXA9, a gene that is expressed during normal breast development 
but epigenetically silenced in breast cancer 11761 . Mel-18 is structurally 
related to but not functionally redundant with Bmi-1. The expression 
levels of these two genes are negatively correlated in several breast 
cancer cell lines' 1771 , and Mel-18 expression is a predictor of poor 
prognosis in patients with breast cancer' 1781 . Experimental findings 
have indicated that Mel-18 acts as a tumor suppressor by repressing 
Bmi-1 expression and down-regulating Akt activity' 179,1801 or competing 



with Bmi-1 for binding to the INK4A/ARF locus' 1811 . 
HOX genes 

As extensively described above, TrxG and PcG maintenance 
proteins are organized in multifactor complexes and play crucial 
roles in the epigenetic control of a large number of genes. They 
function as a part of a widely conserved cell memory system 
that prevents changes in cell identity by maintaining transcription 
patterns throughout development and in adulthood' 1821 . Therefore, 
any perturbation of the levels of TrxG and PcG proteins may 
have dramatic effects on cell identity, and studies have clearly 
demonstrated that the deregulated expression of genes involved in 
stem cell renewal or developmental regulation may trigger neoplastic 
transformation' 1831 . The deregulation of stem/progenitor cell self- 
renewal has also been proposed as a key event in mammary 
carcinogenesis' 184,1851 . 

Among the genes that play a direct role in cell fate specification 
and maintenance of cell identity, HOX genes are the best studied. 
During embryonic development, HOX gene expression controls the 
identity of various regions along the anterior-posterior axis of the 
embryo according to the rules of temporal and spatial collinearity, 
that is, HOX gene expression along the anterior-posterior axis is 
collinear with the 3' to 5' organization of the HOX genes on the 
chromosome' 1861 . HOX gene expression varies among normal 
adult organs. Indeed, each organ, including the mammary gland, 
shows a specific pattern of HOX gene expression that represents 
a unique molecular signature. Of the 39 human HOX genes, which 
are organized into four paralogous clusters (HOX loci A, B, C, 
D), 17 {HOXA3, HOXA7, HOXA8, HOXA9, HOXAW, HOXA11, 
HOXB3, HOXB5, HOXB6, HOXB7, HOXC6, HOXCW, HOXC11, 
HOXC13, HOXD1 HOXD8, and HOXD9) are expressed in the 
normal adult mammary gland' 1871 and are differentially expressed in 
breast cancer' 1881 . In particular, the HOX genes that are altered in 
breast cancer mostly correspond to paralogs 1-4 [HOXA1, HOXA3, 
HOXA5, HOXB2, HOXD3, and HOXD4) and 9-13 {HOXD10, 
HOXA11, HOXB13, and HOXD13). Among the altered HOX genes, 
HOXA5 seems to be the most important. Indeed, the HOXA5 protein 
level is decreased in nearly 70% of all breast carcinomas' 1891 , and 
experimental studies have indicated that compromised HOXA5 
function could limit P53 expression, thus affecting P53 tumor 
suppressor activity' 1901 . In addition, HOXA5 gene silencing due to 
selective hypermethylation of CpG islands in the promoter region 
also affects PR expression levels and consequently the terminal 
differentiation of the mammary epithelium' 191 ,192 l 

Despite the differential expression of HOX genes between 
normal and cancer tissues, nothing is known about the dynamics of 
HOX gene expression, especially during mammary gland turnover. 
However, because mammary gland development and differentiation 
depend upon the presence of ovarian hormones, it is reasonable to 
speculate that there may be a link between HOX gene expression 
and steroid hormones. Experimental evidence corroborates this 
hypothesis. Indeed, some studies have demonstrated that several 
HOX genes, including HOXCW and HOXC13, are transcriptionally 
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regulated by estrogen via several EREs in their promoter region 1 
Furthermore, HOXC6, which is involved in mammary gland 
development and milk production, was recently shown to be 
transcriptionally regulated by estrogen via two putative EREs within 
its promoter region' 1941 . Although no direct link between altered HOX 
gene expression and breast tumorigenesis has been established, it 
should be hypothesized that an abnormal concentration of steroid 
hormones, particularly estrogen, could affect HOX gene expression 
dynamics and contribute to cancer initiation or progression. Indeed, 
a misexpression of HOX genes could result in a different gene 
expression profile, leading to changes in cell differentiation or the 
adoption of an alternative cell identity (i.e., homeotic transformation). 

In a similar scenario, the TrxG and PcG maintenance proteins 
play crucial roles as master regulators of HOX gene expression. 
At least six independent genome-wide studies have identified 
common Polycomb targets in vertebrates and flies, most of which are 
homeobox genes or other developmental transcription factors' 1951 . In 
addition, it has been demonstrated that MLL2 and MLL3, the catalytic 
subunits of covalent histone-modifying complexes with H3K4-specific 
methyltransferase activities, coordinate with ER to transcriptionally 
regulate HOXC6 in an estrogen-dependent manner' 1941, and 
MLL3 knockdown suppresses the estrogen-induced activation of 
HOXC13' 1961 . 

Concluding Remarks 

During normal postnatal mammary gland development and 
remodeling during the menstrual cycle, pregnancy, and lactation, 
ovarian hormones and peptide growth factors contribute to the 
delineation of a definite cell identity by regulating tissue-specific 
gene expression. This gene expression is maintained during cell 
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